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The invention relates to a time-stationaiy processor acianged for execution of a 
piogram, the processor comprising: a plurality of execution units, a register file accessible by 
the execution units, a conomunication network for coupling tibe execution units and the 
register file, and a controller arranged for controlling the processor based on control 

5 information derived fix>m the program. 

The mvention fbrfher relates to a method for controlling a time-stationary 
processor arranged for execution of a program, wherein the processor comprises: a plurality 
of execution units, a register file accessible by the execution units, a communication network 
for coupling the execution units and the register file, and a controller arranged for controlling 

10 the processor based on control information derived firom the program. 



Digital signal processing plays an important role in flie telecommunications, 
multimedia and consumer electronics industries. For performing the operations involved in 

15 digital signal processing, a special type of processor may be designed, referred to as a digital 
signal processor. Digital signal processors can be programmable processors or application- 
specific instruction-set processors. Programmable processors are general-purpose processors 
and they can be used for manipulating different types of information^ including sound, 
images and video. In case of application specific instraction-set processors, the processor 

20 architecture and instruction set is customized, which reduces the system's cost and power 
dissipation significantiy. The latter is cracial for portable and networic powered equipment 

Digital signal processor ardiitectures consist of a fixed data path, which is 
controlled by a set of control words. Each control word controls parts of the data path and 
these parts may conprise register addresses and operation codes for arithmetic logic units 

25 (ALUs) or other functional units. Each set of instructions generates a new set of control 

words, usually by means of an instruction decoder which translates the binary format of the 
instruction into tiie corresponding control word, or by means of a micro store, i.e. a memory 
which contains the control words direcfly. Typically, a control word represents a RISC like 
operation, con^rising an operation code, two operand register indices and a result register 
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index. The operand register indices and the result register index refer to registers in a register 
file. 

A Very Large Instruction Word (VLIW) processor is often used for digital 
signal processing. In case of a VLIW processor, multiple instructions are packaged into one 
5 long instruction,' a so-called V Li W instruction. A VLIW proc^'sor uses multiple, 

independent execution units to execute these mult^le instructions in parallel. The processor 

4 

allows exploiting instruction-level parallelism in programs and thus executing more than one 
instruction at a time. Due to this form of concurrent processing, the performance of the 
processor is increased. Jn order for a software program to run on a VLIW processor, it must 

10 be translated into a set of VLIW instructions. The compiler attempts to minimize the time 
needed to execute the program by optimizing parallelism. The compiler combines 
instructions into a VLIW instruction under the constraint that the instructions assigned to a 
single VLIW instmction can be executed in parallel and under data dependency constraints. 
The encoding of parallel instructions in a VLIW instruction leads to a severe increase of the 

IS code size. Large code size leads to an increase in program memory cost both in terms of 
required memory size and in terms of required memory bandwidth. In modem VLIW 
processors dil^rent measures are taken to reduce the code size. One important example is the 
compact representation of no operation (NOP) operations in a data stationary VLIW 
processor, i.e. the NOP operations are encoded by single bits in a special header attached to 

20 the firont of the VLIW instruction, resulting in a compressed VLIW instruction. 

To control the operations in the data pipeline of a processor, two different 
mechanisms are commonly used in computer architecture: data-stationary and time-stationary 
^coding, as disclosed in ^'Embedded software in real-time signal processing systems: design 
technologies", G. Goossens, J. van Prael^ D. Lanneer, W. Geurts, A. Kifli, C. liem and P. 

25 Paulin, Proceedings of the EBEE, vol. 85, no. 3, March 1997. In the case of data-stationary 
encoding, evety instmction that is part of the processor's instraction-set controls a complete 
sequmce of operations that have to be executed on a specific data item, as it traverses the 
data pipeline. Once the instruction has been fetohed fix)m program memory and decoded, the 
processor controller hardware will make sure that the composing operations are executed in 

30 the correct machine cycle. In the case of time-stationary coding, every instruction that is part 
of the processor's instmction-set controls a complete set of operations tiiat have to be 
executed in a sin^e machine cycle. These operations may be applied to several different data 
items traversing the data pipeline. In this case it is the responsibility of the programmer or 
compiler to set up and maintain the data pipeline. The resulting pipeline schedule is fully 
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visible in the machine code program. Time-stationaiy encoding is often used in application- 
specific processors, since it saves the overhead of hardware necessary for delaying flie 
control information present in the instructions, at the ejqpense of larger code size. 

It is a disadvantage of time-stationary processors fliat conditional operations, 
i.e. operations that return a result based on a condition computed at tun-time, can not be 
siq)ported. Time-stationary encoding demands tiiat all control information, including the 
write back of results to a register ffle, is statically determined at con5>ile time and encoded in 



the prog 



It is an object of tiie invention to enable the use of conditional execution of 
operations in time-stationaiy processors without the use of jun^ operations, while 
tnatnteiiTimg the advantages of time-stationary encoding. 

This object is achieved with a processor of the kind set forth, characterized in 
tiiat flie processor is fiirflxer actanged to dynamically control tiie transfer of result data fiom 
an execution unit of the pluraUly of execution units to tiie register file, based on flie control 
information. By dynamically controlling the write back of result data to the register file, it • 
can be determined during run-time if the result data of an operation have to be written back to 
tiie register file. As a result, the conditional execution of operations can be inqplemented on a:- 
time-stationary processor, without the use of jump operations. 

An embodiment of tiie invention is characterized in that that the control 
mfbimation comprises an first identifier on tiie vaUdity of an operation, and wherein the 
processor is arranged to dynamically control writing of result data corresponding to tiie 
operation into the register file, based on the first identifier. In case of an invalid operation, ie. 
a so-called NOP operation, no result data have to be written bade to the register file. By usmg 
tiie identifier, the writing back of result data is direcfly disabled in case of an invalid 



operation. 



Ane 







•II 







delayed according to the pipeline of the corresponding execution unit arranged for executing 
tiie operation. By delaying tiie identifier according to the pipeline of the execution unit, the 
information required for determining tiie write back of result data becomes available at tiie 
output of the execution unit at same time as the result data itself 

An embodiment of the invention is characterized in that the execution unit is 
arranged to produce a second identifier on the validity of an ou^ut result of a corr^onding 
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output port of the execution unit, and wherein the processor is fiuther arranged to 
dynamically control writing of result data corresponding to the operation into the register file, 
based on both Ihe first identifier and the second identifier. As a result, operations to be 
executed by tiie execution unit are allowed that potentially produce more than one valid 

5 output. " 

An embodimrat of the invention is characterized in that the processor is 
further arranged to dynamically control writing of result data corresponding to the operation 
into the register file, based on the first identifier, the second identifier and an. input datum. 
The input datum represents a true or a false condition, which can be determined in a separate 
10 execution unit and subsequently used in other fimctional units in order to efficientiy 
implement a guarded operation. 

An embodiment of the invention is characterized in that the register file is a 
distributed register file. An advantage of a distributed register file is that it requires less read 
and write ports per register file segment, resulting in a smaller register file in terms of silicon 
15 area. Furthermore, the addressing of a register in a distributed register file requires less bits 
when conipared to a central register file. 

An embodiment of the invention is characterized in that the communication 
network is a partially connected communication network. A partially coimected 
communication network is often less timing critical and less expensive in terms of code size, 
20 area and power consumption, when compared to a fiiUy coimected communication network, 
especially in case of a large number of execution units. 

According to the invention a method for controlling a processor is 
characterized in that the method for controlling comprises the step of dynamically controlling 
the transfer of result data from an execution unit of the plurality of execution units to the 
25 register file, using the control information. By dynamically controlling the transfer of result 
data to an execution unit, it can be decided at run-time if result data have to be written back 
to the register file, allowing implementing guarded operations by time-stationary encoding. 



30 Figure 1 shows a schematic block diagram of a first VLIW processor 

according to the invention. 

Figure 2 shows a schematic block diagram of a second VLIW processor 
according to the invention. 
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Refening to Fig. 1 and Fig. 2, a schematic block diagram illustrates a VLTW 
processor comprising a plurality of execution units EXl and EX2, and a distributed register 
file, including register file segments RFl and RF2. The register file segments RFl and RF2 
are accessible by execution units EXl and EX2, respectively, for retrieving input data ID 
from the register file. The execution units EXl and EX2 also are coupled to the register file 
segments RFl and RF2 via the communication network CN and multiplexers MPl and MP2, 
for passing result data RDl and RD2 firom said execution units to the distributed register file. 
The controller CTR retrieves instructions from the program memory PM and decodes these 
instructions. In general, these instructions con^rise RISC like operations, requiring only two 
operands and producing only one result, as well as custom operations that can consume more 
than two operands and/or that can produce more than one result Some instructions may 
require small or large immediate values as operand data. Results of the decoding step are the 
write select indices WSl and WS2, write register indices WRl and WR2, read register 
indices RRl and RR2, operation valid iodices OPVl and OPV2, and qpcodes OCX and 002.^ 
Via the couplings between the controller CTR and multiplexers MPl and MP2, the write 
select indices WSl and WS2 are provided to the multiplexers MPl and MP2, respectively. 
The write select indices WSl and WS2 are used by the correq>ondiDg multiplexer for 
selecting the required input chaimel fix>m tiie communication network CN fixr the data WDl = 
and WD2 that have to be written to register file segments RFl and RF2, respectively- The . : 
write select indices WSl and WS2 are also used by the corresponding multiplexer for 
selecting the input channel from the communication network CN for the write enable indices 
WEI and WE2 lhat are used to enable or disable the actual writing of data WDl andWD2to 
the corresponding register ffle segment RFl and RF2. The controller CTR is coupled to the 
register file segments RFl and RF2 for providing the write register indices WRl and WR2, 
respectively, for selecting a register firom the corresponding register file segment to which 
data have to be written. The controller CTR also provides the read register indices RRl and 
RR2 to the register file segments RFl and RF2, respectively, for selecting a register from the 
cotresponding register file segment fix>m which input data ID have to be read by the 
execution units EXl and EX2, respectively. The controller CTR is coupled to the execution 
unite EXl and EX2 as well, for providing Ihe opcodes CX^l and OC2, respectively, that 
define the type of operation that tiie execution unit EXl or EX2 has to perform on the 
corresponding iaput data ID, The operation valid indices OPVl and OPV2 are also provided 
to execution units EXl and EX2, respectively, and these indices indicate if a valid operation 
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is defbaed by the CQnesponding opcode OCl or 0C2. The value of the operation vaKd indices 
OPVl and OPV2 is determined duting decoding of the VLIW instraction. In a prior art time- 
stationary processor, the write enable indices used for enabling or disabling the writing of 
data firom the execution units to the register file, are statically determined, since they are 
5 encoded in the program' at 'cbn:q>ile'time: The controUef bl>tainslhe wri'tFenable indices fcmoT 
the program after decoding, and directiy provides the write enable indices to the register file. 

Referring to Fig. 1, the controller CTR is coupled to registers lOS. The 
controller CTR derives operation valid indices OPVl and OPV2 from the program during the 
decoding step and these operation valid indices are provided to the registers 105. In case the 

10 encoded operation is a NOP operation, the operation valid index is set to false, otherwise the 
operation valid index is set to true. The operation valid indices OPVl and OPV2 are delayed 
according to the pipeline of the corresponding execution unit EXl and EX2 using registers 
105, 107 and 109. After execution of the operations by execution unit EXl and EX2, as 
defined via opcodes OCl and OC2 respectively, the corresponding result data RDl and RD2 

15 as well as the corresponding output vaKd indices OVl and OV2 are produced. The output 
valid index OVl or OV2 is true if Ihe corresponding result data RDl or RD2 are valid, 
otherwise it is &lse. Unit 101 performs a logic AND on the delayed operation valid index 
OPVl and the output valid index OVl, resulting in a result valid index RVl . Unit 103 
performs a logic AND on the delayed operation valid index OP V2 and the output valid index 

20 OV2, resulting in a result valid index RV2. The units 101 and 103 are both coupled to 

multiplexers MPl and MP2, via the partially comiected network CN, for passing the result 
valid indices RVl and RV2 to the multiplexers MPl and MP2. The write select indices WSl 
and WS2 are used by the corresponding multiplexers MPl and MP2 to select a channel fit)m 
the connection network CN fix>m which result data have to be written to Ihe corresponding 

25 register file segment. In case a result data channel is selected by a multiplexer, the result valid 
indices RVl andRV2 are used to set the write enable indices WEI and WE2, for control of 
writing result data RDl and RD2 to the register file segments RFl and RF2, respectively. In 
case multiplexer MPl or MP2 has selected the input channel corresponding to result data 
RDl, result valid RVl is used for setting the write enable index corresponding to that 

30 multiplexer, and in case tiie input channel corresponding to result data RD2 is selected, result 
valid index RV2 is used for setting the corresponding write enable index. If result valid index 
RVl or RV2 is trae, the ^[jpropriate write enable index WEI or WE2is settotraeby the 
corresponding multiplexer MPl and MP2. In case the write enable index WEI orWE2is 
equal to trae, the result data RDl or RD2 are written to the register file segment RFl orRF2, 
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in a register selected via the write register index WRl or WR2 corresponding to that register 
ffle segment. La case the write enable index WEI or WE2 is set to false, thou^ via the 
corresponding write select index WSl or WS2 an input channel for writing data to 
corresponding register ffle segment RFl or RF2 has been selected, no data will be written 
into that register ffle segment In order to disable the write back of any result data KDl or 
RD2 via a given write port of register ffle segments RFl and RF2, respectively, the write 
select index WSl or WS2 corresponding to that register ffle segment can be used to select the 
defeult input 111 ftom the corresponding multiplexer MPl or MP2, in which case no result 
data are written to that register ffle segment 

Referring to Fig. 2, the controller CTR is coupled to logic units 201 and 205. 
The controller CTR retrieve operation valid indices OPVl and OPV2 ftom the program 
during fhs decoding step and these operation valid indices are provided to logic unit 201 and 
205, respectively. In case the encoded operation is a NOP operation, the operation valid 
index is set to felse, otherwise flie operation valid index is set to true. The registCT ffle 
segments RFl and RF2 are coiQ>led to unit 201 and 205 respectively, and the corresponding 
guards GUI and GU2 can be written ftom the register ffle segments RFl and RF2 to the units 
201 and 205, respectively. The guards GUI and GU2 can be either true or felse, depending 
on the outcome of the operation during which the value of that guard was determined. Unite 
201 and 205 perform a logic AND on the corresfponding operation valid mdex OPVl or 
OPV2, and the corresponding guard GUI or GU2. The resulting index is delayed according 
to the pipeline of the corresponding execution unit EXl and EX2 using registers 209, 211 and 
213. After the operation, defined via opcode OCl or OC2, has been executed by execution 
unit EXl and EX2, respectively, the corresponding result data RDl and RD2 as well as the 
corresponding output valid index OVl and OV2 are produced. The output valid indices OVl 
and OV2 are true if the corresponding result data RDl or RD 2 are valid ou^ data, 
otherwise fliey are felse. Unit 203 performs a logic AND on the delayed index, resulting from 
guard GUI and operation vaiid index OPVl, and the output vaHd index OVl, resulting in a 
result valid index RVl. Unit 207 performs a logic AND on the delayed index, resulting from 
guard GU2 and operation valid index OPV2, and the output valid index OV2, resulting in a 
result valid index RV2. The units 203 and 207 are coupled to multiplexers MPl and MP2, 
respectively, via the partially connected network CN, for passing the result valid indices RVl 
and RV2 to multiplexers MPl and MP2. The result valid indices RVl and RV2 are used to 
set the write enable index WEI or WE2 for control of writing r^ult data RDl or RD2 to the 
register ffle segments RFl and RF2. The write sdiect indices WSl and WS2 are used by the 
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cortespQiiding multipliers MPl and MP2 to select a channel from the connection network 
CN from which resiilt data have to be written to the corresponding register file segment In 
case a result data channel is selected by a multiplexer, the result valid indices RVl and RV2 
are used to set the write enable indices WEI and WE2, for control of writing result data RDl 
5 and RD2 to the regiistSf file'segmSttts RFl and RF2, res^ecHvely. In case multiplex^ MPl oT 
MP2 has selected the input channel corresponding to result data RDl, result valid RVl is 
used for setting the write enable index corresponding to that multiplexer, and in case the 
input channel corresponding to result data RD2 is selected, result valid index RV2 is used for 
settmg the corresponding write enable index. If result valid index RVl or RV2 is true, the 

10 ^propriate write enable index WEI or WE2 is set to true by the corresponding multiplexer 
MPl andMP2. In case the write enable index WEI or WE2 is equal to trae, flie result data 
RDl or RD2 are written to the register file segment RFl or RF2, in a register selected via the 
write register index. WRl or WR2 corresponding to that register file segment In case the 
write enable index WEI or WE2 is set to felse, though via the corresponding write select 

15 index WSl or WS2 an input chaonel for writing data to corresponding register Sle segment 
RFl or RF2 has been selected, no data will be written into that register file segment. In order 
to disable the write back of any result data RDl or RD2 via a given write port of register file 
segments RFl and RF2, respectively, the write select index WSl or WS2 corresponding to 
that register file segment can be used to select the de&ult icput 111 fit>m the corresponding 

20 multiplexer MPl or MP2, in which case no result data are written to that register file 
segment. 

The time-stationary VLIW processors according to Fig. 1 and Fig. 2 allow 
dynamically controlling the write back of result data to the register file. It can be determined 
during run-time if the result data of an operation that has been executed have to be writtm 
25 back to the register file. As a result, conditional operations can be implemented by a 
processor using time-stationary encoding of instructions. 

Below an example of a piece of program code is shown, that should be 
executed by a time-stationary processor according to the invention. In this program code the 
letters A, BO, Bl, B2, CO, CI and D refer to statements and X to a condition that can either be 
30 false or trae. 



A; 
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if (5Q then 
{ 

B0;B1;B2; 
} 

dse 

{ 

CO; CI; 
} 

D; 



The program code can be executed by a prosessor according to Fig. 2 as 
foUows. The program code is converted by the comqwler using a weU-known technique caUed 
"if conversion", whicb allows the execution of if-then-else bodies without the need for costly 
branching. Because of this, it even allows the parallel execution of "if-then-else" bodies by 
ensuring that eilJier the "then" or the "else" body returns results based on the 'If condition or: 
its complement used as guard for the instruction(s) in the "then" and "else" bodies. Using "if 
conversion" the above shown piece of program code is converted to: 

* 



A; 



if (X): BO; 
if C5Q: Bl; 
if(JQ:B2; 
if (DC):CO; 
if (GQ: CI; 
D; 



Referring to Fig. 2, an instmction is executed by either execution unit EXl or 
EX 2 to determine the value of condition X. This instruction produces the result "trae", and 
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this result is stored in register file segment KPl and its complement, i.e. the result '"fiilse", is 
stored in register file segment RF2. Next, execution unit EXl executes instructions 
comprising statements BO, Bl and B2, and execution unit EX2 executes instructions 
comprising statements CO and CI. Because of the removal of the control flow in the if- 
S converted pix)g]mQ7wliich is normally implemented using jump operations and therefore 
sequential in nature, operations in the ^^en" and ^'else" bodies of the original program can 
now be scheduled in parallel, if data dependenci^ and availability of resources permit to do 
so. The controller CTH decodes the VLIW instmction, and sends the resulting write select 
indices WSl and WS2 to the corresponding multiplexers MPl and MP2, the write register 

10 indices WRl and WR2 as well as read register indices RRl and RR2 to the corresponding 
register file segments RFl and KF2, the operation codes OCl and OC2 to the corresponding 
execution units EXl and EX2 and the operation valid indices OPVl and OPV2 to the 
corresponding unit 201 and 205. These operation valid indices OPVl and OPV2 are equal to 
*true". The units 201 and 205 also receive the result of the evaluation of statement X or its 

15 complemeni, respectively, as a corresponding guard GUI and GU2, and perform a logic 
AND of the guard and the operation valid index In case of unit 201 the logic AND will 
produce ^true" as a result, while in case of unit 205 tiie logic AND will produce ^^false" as a 
result^ since tiie guards GU 1 and GU2 are equal to true and false, respectively. While 
statements BO, Bl, B2, CI or C2 are executed by execution units EXl and BX2 resfpectively, 

20 the results of the logic AND are clocked through the registers 209, 21 1 and 213. Both for 

execution unit EXl and EX2 the corresponding output valid indices OVl and OV2 are equal 
to tme. Unit 203 will perform a logic AND of the operation valid OVl and the result of the 
logic AND performed by unit 201 . The result of this logic AND will be true, and therefore 
result valid index RVl is equal to true. Via partially connected network CN, the value of 

25 result valid index RVl as well as the corresponding result data KDl are transferred to 

multiplexers MPl and MP2. Using the write select index WSl, the multiplexer MPl selects 
the ii^ut channel corresponding to result data RDl. The write enable index WEI is 
subsequentiy set to tme using result valid index RVl, and the result data RDl are written to 
register file segment RFl as data WDl. Unit 207 will perform a logic AND of the operation 

30 valid OV2 and the result of the logic AND performed by unit 205. The result of this logic 
AND wiU be false, and therefore result valid index RV2 is equal to &]se. Via partially 
connected netwotk CN, the value of result valid index RV2 as well as the result data RD2 are 
transferred to multiplexers MPl and MP2. Using the write select index WS2, the multiplexer 
MP2 selects the channel corresponding to result data RD2, The write enable index WE2 is 
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subsequently set to felse using result vaHd index RV2, and so the result data BD2 are not 
■written to register ffle segment RF2. Alternatively, liie value of guard X and its complement 
can be stored in both raster file segment RFl and register file segment RF2. Now 
statements BO, Bl, B2, CO and CI can be executed by both execution unit EXl and execution 
unit EX2. In case execution unit EXl or EX2 is executing statements BO, Bl or B2 the value 
of X is used for guard GUI or GU2, respectively. If execution unit EXl or EX2 is executing 
statemems CO or CI the complement of X is used for guard GUI or GU2, respectively. As a 
result, when executing statements BO, Bl or B2 the result date RDl or RD2 are written to 
register file segment KFl and/or RF2. If statements CO or CI ate executed, the result data 
RDl or RD2 are not written to register file segment RFl and/or RF2. 

Below another example of a piece of program code is shown, that should be 
executed by a time-stationary processor according to the invention. In this program code flie 
letters Z, P and Q refer to variables and X to a condition that can either be felse or true. When 
executing flris program ftagment, the value of P and Q are added, and the result is assigned to 
Z, if condition X is equal to true. 



if (X) then 

{ 

Z = add (P, Q); 
> 



The program code can be executed by a processor according to Fig. 1 as 
follows. The program code is converted by the cotnpil&c and Ihe add operation is replaced by 
a conditional add operation, cadd, taking the value of condition X as an additional argument: 



Z = cadd (K, P, Q); 
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Refening to Fig. 1, an instruction is executed by either execution unit EXl or 
EX 2 to determine the value of condition X. This instruction produces the result 'true'% and 
this result is stored in register file segment KFl. The value of parameters P and Q are stored 
S in register file segm^tKF 1 as well. The cadd instn^tionlFexecuted by execution unit EXi . 
The value of condition X, as well as parameters P and Q are received as input data ID by 
execution unit EXl. During execution of instruction cadd, the value of condition X is 
evaluated by execution unit EXl and if this value is equal to true. Hie output valid index OVl 
is set equal to tme. In case the value of condition X is equal to £alse, the output valid index 

10 OVl is set equal to false. In this example, the value of condition X is equal to tme, and 
therefore the value of output valid index OVl is set equal to trae as well. Furthermore, 
execution unit EXl calculates the value of parameter Z. Unit 101 performs a logic and on the 
operation valid index OPVl corresponding to instruction cadd and the output valid index 
OVl. Since the operation valid index OPVl is equal to true, the resulting result valid index 

IS RVl is equal to tme as well. The result valid index RVl and the result data KDl, in the form 
of the value of parameter Z, are transferred to multiplexers MPl and MP2 via partially 
connected network CN. Using write select index WSl, multiplexer MPl selects the channel 
corresponding to result data RDl as input channel. Multiplexer MPl sets the write enable 
index WEI equal to tme using result valid index RVl, and the value of parameter Z is 

20 written to register file segment RFl as write data WDl. In case the condition X is equal to 
fEilse, the ou^ut valid index OVl is set to false by execution unit EXl. The lo^c AND 
performed by unit 101 results in a result valid index RVl equal to &lse. As aresult, the write 
enable index WEI is set to filse. In this case Ihe value of parameter Z is not written to 
register file segment RFl. 

25 The above examples show that the conditional execution of operations in time- 

stationary processors without the use of jtunp operations can be implemented, by 
dynamical^ controlling the transfer of result data from an execution unit to a register file. 

In another embodiment the ccnnmunication network CN may be a partially 
connected communication network, ie. not every execution unit EXl and EX2 is coupled to 

30 all register file segments RFl and RF2. In case of a large number of execution units, the 
overhead of a fiilly coimected coromunication network wiU be considerable in terms of 
silicon area, delay and power consumptiosL During design of the VLIW processor it is 
decided to which degree llie execution units are coupled to the register file segments, 
depending on the range of applications that has to be executed 
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In another embodiment the distributed register file, comprising register file 
segments RFl and RF2, is a single registo: file. In case the number of execution units of a 
VUW processor is relatively small, the overhead of a single register file is relatively small as 
well. 

In another embodiment, the VLIW processor may have more execution units. 
The number of execution units depends on the ^e of plications that the VLIW processOT 
has to execute, amongst others. The processor may also have more register file segments, 

connected to said execution units: 

In another embodiment^ the execution units EXl and EX2 may have multiple 
inputs and/or multiple ou^uts, depending on the type of operations that the execution units 
have to perform, i.e. operations that requhe mote than two operands and/or produce more 
than one result. The register file may also have multiple read and/or write ports per register 
file segment. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that tiiose skilled in tiie art will be able to design many alternative 
embodiments without departing ficom the scope of the s^ended claims. In the claims, any 
reference signs placed between parentiieses shall not be construed as limiting the claim.. The 

■ I 

word "con^jrising" does not exclude the presence of elements or steps other than those listed j 
in a claim. The word "a" or "an" preceding an element does not exclude the pres^ce of a 
plurality of such elements. In the device claim enumerating several means, several of these 
means can be embodied by one and the same item of hardware. The mere fact that certain 
measures are recited in mutually different dependent claims does not indicate that a 
combination of tiiese measures cannot be used to advantage. 
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(XAIMS: 



1^ A time-stationary processor arranged for execution of a program. Hie processor 



comprising: 

a plurality of execution units; 
a register file accessible by the execution units; 
5 . a communication networlc for coupling the execution units and the register 

file; 

- a controller arranged for controlling the processor based on control 

information derived from the program, 

characterized in that the processor is furfher arranged to dynamically control the transfer of 
10 r^ult date from an execution unit of the plurality of execution units to the register file, based 
on the control information. 

* i 

2. A processor according to claim 1, characterized in that the control information 

coiiqmses a M identffier on liie vaHdily of an operation, and wherein 
1 5 ananged to dynamically control writing of result data corresponding to the operation into the;' 
register file, based on the first identifier. 



3 A processor according to claim 2, characterized in that the first identifier is 

delayed according to the pipeline of the corresponding execution unit arranged for executing 
20 the operation. 

4. A processor according to claim 1 , characterized in that the execution unit is 

arranged to produce a second identifier on the validity of an ou^ut result of a corresponding 
output port of the execution unit, and wherein the processor is fiirther arranged to 
25 dynamically ctmtrol writing of result data corresponding to the operation into the register file, 
based on both the first identifier and the second identifier. 
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S. A processor according to claim 4, characterized in fhat Ihe jurocessor is further 

arranged to dynamically control writing of result data corresponding to the operation into the 
register file, based on the first identifier, the second identifier and an input datum. 

5 6. A^bcessor according to claim 1^ cfiarac^^ 

distributed register file. 

7. A processor according to claim 1, characterized in that the conmiunication 
network is a partially connected communication network. 

10 

8. A method for controlling a time-stationary processor arranged for execution of 
a program, wherein the processor comprises: 

a plurality of execution units; 
a register file accessible by the execution units; 
1 5 - a communication network for coupling the execution units and the register 

file; 

a controller arranged for controlling the processor based on control 
information derived firom the program, 

characterized in that the method for controlling comprises the step of djmamically controlling 
20 the transfer of result data from an execution unit of the plurality of escecution units to the 
register file, using the control information. 
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In case of time-statiofnaiy encoding, every instraction that is part of the 
processor's instruction-set controls a complete set of operations that have to be executed m a 
single machine cycle. These operations may be processing several different data items 
traversing the data pipeline. Time-stationary encoding is often used in a^jpHcation-specific 
processors, since it saves the overhead of hardware necessary for delaying the control 
information present in the instructions, at the expense of larger code size. A disadvantage of 
time-stationary encoding is that is does not support conditional operations. The invention 
proposes to dynamically control the write back of result data to the register ffle of the time- 
stationary processor, using control information obtained by tiie program. By controlling the 
write bade of data at run-time, conditional operations can be implemented by a time- 
stationary processor. 



Fig. 2 



PHNL030384 





This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the appUcant. 

Defects in the images include but are not limited to the items checked: 



BLURRED OR ILLEGIBLE TEXT OR DRAWING 

I 

4 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




BLACK BORDERS 



□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 



□ FADED TEXT OR DRAWING 




